Categories

Versions

You are viewing the RapidMiner Studio documentation for version 10.0 - Check here for latest version

Generate Session ID (Operator Toolbox)

Synopsis

This operator creates a new attribute with a consecutive id identifying a session.

Description

A session is defined as a series of dates with less than max gap time units between them. The max gap threshold and its unit can be configured. A possible subsequent operator is Group Into Collection to process each session separately.

Input

  • exa (Data Table)

    The input ExampleSet

Output

  • out (Data Table)

    The resulting ExampleSet with the new Session id.

  • ori (Data Table)

    The original ExampleSet.

Parameters

  • date_attribute The name of the attribute which contains the date to be used. Range:
  • gap_threshold The maximum difference between two dates that will still belong to the same session. Range:
  • gap_unit Unit of the gap threshold. If "none" is selected, this operator can also be applied on numerical attributes. The threshold is specified without sign (i.e. as an absolute value). Range:
  • use_absolutes If activated the absolute difference between two examples is used. Range:

Tutorial Processes

Creating a session id on dummy data

In this example we first create a dummy data set with the Create ExampleSet operator. This data set has two sessions: one is in January and the other is in February. The Create Session ID operator generates session IDs for them. It is also possible to run this operator on numerical attributes. In this case you can select "None" as date unit. The threshold is specified without sign (i.e. as an absolute value).